Learning Robotic Manipulation Tasks via Task Progress Based Gaussian Reward and Loss Adjusted Exploration

نویسندگان

چکیده

Multi-step manipulation tasks in unstructured environments are extremely challenging for a robot to learn. Such interlace high-level reasoning that consists of the expected states can be attained achieve an overall task and low-level decides what actions will yield these states. We propose model-free deep reinforcement learning method learn multi-step tasks. introduce Robotic Manipulation Network (RoManNet) 1 , which is vision-based model architecture, action-value functions predict action candidates. define Task Progress based Gaussian (TPG) reward function computes on lead successful motion primitives progress towards goal. To balance ratio exploration/exploitation, we Loss Adjusted Exploration (LAE) policy determines from candidates according Boltzmann distribution loss estimates. demonstrate effectiveness our approach by training RoManNet several robotic both simulation real-world. Experimental results show outperforms existing methods achieves state-of-the-art performance terms success rate efficiency. The ablation studies TPG LAE especially beneficial like multiple block stacking.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Vision Based Robotic Interception in Industrial Manipulation Tasks

In this paper, a solution is presented for a robotic manipulation problem in industrial settings. The problem is sensing objects on a conveyor belt, identifying the target, planning and tracking an interception trajectory between end effector and the target. Such a problem could be formulated as combining object recognition, tracking and interception. For this purpose, we integrated a vision sy...

متن کامل

the relationship among efl learners’ autonomy, first language essay writing tasks and second language essay writing tasks in task/content based language instruction

the ability of composing a coherent and extended piece of writing in second language is considered as a fundamental factor to convey information and ideas of learners through the academic issues. although learners may achieve a perfect academic writing skill through assigning the l2 tasks in content based instruction, but demonstration of their abilities may be related to their ability in l1 es...

15 صفحه اول

Prediction learning in robotic manipulation

This thesis addresses an important problem in robotic manipulation, which is the ability to predict how objects behave under manipulative actions. This ability is useful for planning of object manipulations. Physics simulators can be used to do this, but they model many kinds of object interactions poorly, and unless there is a precise description of an object’s properties their predictions may...

متن کامل

Task Planning for Robotic Manipulation in Space

Authors' current addresses: A.C. Sanderson, Department of Electrical, Computer and Systems Engineering, Rensselaer Polytechnic Institute, Troy, NY 12180; M.A. Peshkin, Department of Mechanical Engineering, The Technological Institute, Northwestern University, 2145 Sheridan Rd., Evanston, IL 60201; L.S. Homem de Mello, Robotics Institute, Department of Electrical and Computer Engineering, Camegi...

متن کامل

Preschoolers evaluate risk and reward in exploration-exploitation tasks

Children are drivers of their own discovery. To develop a complete characterization of the factors that drive exploration in early childhood, we must first understand how competing factors influence children’s decision making. We investigated preschool-aged children’s decision-making on explore-exploit tasks where the available information about the distribution of rewards was controlled. When ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE robotics and automation letters

سال: 2022

ISSN: ['2377-3766']

DOI: https://doi.org/10.1109/lra.2021.3129833